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We improve previously proposed conditions each measure of entanglement 
has to satisfy. We present a class of entanglement measures that satisfy these 
conditions and show that the Quantum Relative Entropy and Bures Metric 
generate two measures of this class. We calculate the measures of entangle- 
ment for a number of mixed two spin 1/2 systems using the Quantum Relative 
Entropy, and provide an efficient numerical method to obtain the measures 
of entanglement in this case. In addition, we prove a number of properties of 
our entanglement measure which have important physical implications. We 
briefly explain the statistical basis of our measure of entanglement in the case 
of the Quantum Relative Entropy. We then argue that our entanglement 
measure determines an upper bound to the number of singlets that can be 
obtained by any purification procedure. 



It was thought until recently that Bell's inequalities provided a good criterion for sep- 
arating quantum correlations (entanglement) from classical ones in a given quantum state. 
While it is true that a violation of Bell's inequalities is a signature of quantum correlations 



pletely separate quantum from classical correlations a new criterion was needed. This also 
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I. INTRODUCTION 




states violate Bell's inequalities So, in order to com- 



initiated the search into the related question of the amount of entanglement contained in a 
given quantum state. There are a number of 'good' measures of the amount of entanglement 
for two quantum systems in a pure state (see for an extensive presentation). A 'good' 
measure of entanglement for mixed states is, however, very hard to find. In an important 
work Bennett et al [Q] have recently proposed three measures of entanglement (we will dis- 
cuss the entanglement of formation and distillation in more detail later in this paper). Their 
measures are based on concrete physical ideas and are intuitively easy to understand. They 
investigated many properties of these measures and calculated the entanglement of forma- 
tion for a number of states. More recently. Hill and Wootters have proposed a closed form 
for the entanglement of formation for two spin 1/2 particles [Q. Uhlmann's recent work 
implies that the entanglement of formation can also be calculated numerically in an efficient 
way for those cases that are not analytically known [Q. 

We have recently shown how to construct a whole class of measures of entanglement 
PJ^, and also imposed conditions that any candidate for such a measure has to satisfy 
1^. In short, we consider the disentangled states which form a convex subset of the set 
of all quantum states. Entanglement is then defined as a distance (not necessarily in the 
mathematical sense) from a given state to this subset of disentangled states (see Fig. 1). An 
attractive feature of our measure is that it is independent of the number of systems and their 
dimensionality, and is therefore completely general, @J3]- present here two candidates 
for measuring distances on our set of states and prove that they satisfy improved conditions 
for a measure of entanglement (the third condition presented here is an improvement over 
the one given in 0). 

It should be noted that in much the same way we can calculate the amount of classical 
correlations in a state. One would then define another subset, namely that of all product 
states which do not contain any classical correlations. Given a disentangled state one would 
then look for the closest uncorrected state. The distance could be interpreted as a measure 
of classical correlations. In addition to many analytical results we also explain how to 
calculate efficiently using numerical methods our measure of entanglement of two spin 1/2 
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particles. We present a number of examples and prove several properties of our measure 
which have important physical consequences. To illuminate the physical meaning behind 
the above ideas we present a statistical view of our entanglement measure in the case of the 
Quantum Relative Entropy ||^. We then relate our measure to a purification procedure and 
use it to define a reversible purification. This reversible purification is then linked to the 
notion of entanglement through the idea of distinguishing two classes of quantum states. We 
also argue that the measure of entanglement generated by the Quantum Relative Entropy 
that we propose gives an upper bound for the number of singlet states that can be distilled 
from a given state. We find that in general the distillable entanglement is smaller than the 
entanglement of creation. This result was independently proven by Rains for Bell diagonal 
states using completely different methods 

The rest of the paper is organized as follows. Section II introduces the basis of purifica- 
tion procedures, conditions for a measure of entanglement and our suggestion for a measure 
of entanglement. We also prove that the Quantum Relative Entropy and the Bures Met- 
ric satisfy the imposed conditions and can therefore be used as generators of measures of 
entanglement. We compute our measure explicitly for some examples. In Section III we 
introduce a simple numerical method to compute our measure of entanglement numerically 
and we apply it to the case of two spin 1/2 systems. We present a number of examples of 
entanglement computations using the Quantum Relative Entropy. In Section IV we present 
a statistical basis for the Quantum Relative Entropy as a measure of distinguishability be- 
tween quantum states and hence of amount of entanglement. Based on this, in Section V 
we derive an upper bound to the efficiency (number of maximally entangled pairs distilled) 
of any purification procedure. We also show how to extend our measure to more than two 
subsystems. 



II. THEORETICAL BACKGROUND 
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A. Purification Procedures 



There are three different ingredients involved in procedures aiming at distiUing locally a 
subensemble of highly entangled states from an original ensemble of less entangled states. 

1. Local general measurements (LGM): these are performed by the two parties A and 
B separately and are described by two sets of operators satisfying the completeness 



Ai ® Bj = J2iAi0 which is again a complete general measurement, and 

obviously local. 

2. Classical communication (CC): this means that the actions of A and B can be corre- 
lated. This can be described by a complete measurement on the whole space A + B 
and is not necessarily decomposable into a sum of direct products of individual oper- 
ators (as in LGM). If pab describes the initial state shared between A and B then the 
transformation involving 'LGM-I-CC' would look like 



3. Post-selection (PS) is performed on the final ensemble according to the above two pro- 
cedures. Mathematically this amounts to the general measurement not being complete, 
i.e. we leave out some operations. The density matrix describing the newly obtained 
ensemble (the subensemble of the original one) has to be renormalized accordingly. 
Suppose that we kept only the pairs where we had an outcome corresponding to the 
operators Ai and Bj, then the state of the chosen subensemble would be 



relations J2i A\Ai — 1 and Y,j B^Bj — 1. The joint action of the two is described by 



Hpab) = a Pab 




(1) 



where J2i AjAiBjB^ = 1, i.e. the actions of A and B are 'correlated'. 



A, ® B, PAB Aj ® Bj 



(2) 



Pab 



TT{Ai ® Bi PAB Aj ® Bj) 



where the denominator provides the necessary normalization. 
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A manipulation involving any of the above three elements or their combination we shall 
henceforth call a purification procedure. It should be noted that the three operations de- 
scribed above are local. This implies that the entanglement of the total ensemble can- 
not increase under these operations. However, classical correlations between the two sub- 
systems can be increased, even for the whole ensemble, if we allow classical communica- 
tion. A simple example confirms this. Suppose that the initial ensemble contains states 
|0a) ® (|0_b) + \Ib))/V^- The correlations (measured by e.g. von Neumann's mutual in- 
formation between A and B are zero. Suppose that B performs measurement of his 
particles in the standard 0, 1 basis. If 1 is obtained, B communicates this to A who then "ro- 
tates" his qubit to the state \1a)- Otherwise they do nothing. The final state will therefore 
be 

P = ^(|0a)(0a| ® \0b){0b\ + ® \Ib){Ib\) , (3) 

where the correlations are now In 2 (i.e. nonzero). So, the classical content of correlations 
can be increased by performing local general measurements and classically communicating. 

An important result was proved for pairs of spin-1/2 systems in 0: All states that are 
not of the form p^s = J2iPiPA ® Pb; where J2iPi = ^ and Pi > for all i, can be distilled to 
a subensemble of maximally entangled states using only operations 1, 2 and 3. (The states 
of the above form obviously remain of the same form under any purification procedure). 
The local nature of the above three operations implies that we define a disentangled state 
of two quantum systems A and B as a state from which by means of local operations no 
subensemble of entangled states can be distilled. It should be noted that these states are 
sometimes called separable in the existing literature. We also note that it is not proven in 
general that if the state is not of this form then it can be purified. 
Definition 1. A state pab is disentangled iff 

PAB = Y^PiPA^PB ) (4) 

i 

where, as before, J2iPi = 1 and Pi > for all i. Otherwise it is said to be entangled. Note 
that all the states in the above expansion can be taken to be pure. This is because each 
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p* can be expanded in terms of its eigenvectors. So, in the above sum we can in addition 
require that p\ = p\ and = p^ for all i. This fact will be used later in this section and 
will be formalized further in Section III. 



In the previous section we have indicated that out of certain states it is possible to distill 
by means of LGM+CC+PS a subensemble of maximally entangled states (we call these 
states entangled). The question remains open about how much entanglement a certain state 
contains. Of course, this question is not entirely well defined unless we state what physical 
circumstances characterize the amount of entanglement. This suggests that there is no 
unique measure of entanglement. Before we define three different measures of entanglement 
we state three conditions that every measure of entanglement has to satisfy. The third 
condition represents a generalization of the corresponding one in 0. 

El. E{a) = iff cr is separable. 

E2. Local unitary operations leave E{a) invariant, i.e. E{(j) = E{Ua ® UbciU\ t/jj). 
E3. The expected entanglement cannot increase under LGM+CC+PS given by = \ 



where cxj = ViaV^ . 

Condition El ensures that disentangled and only disentangled states have a zero value 
of entanglement. Condition E2 ensures that a local change of basis has no effect on the 
amount of entanglement. Condition E3 is intended to remove the possibility of increasing 
entanglement by performing local measurements aided by classical communication. It is an 
improvement over the condition 3 in which required that E{J2iVi(^Vi) < E{a). This 



B. Quantification of Entanglement 



I.e. 




(5) 
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condition E3 is physically more appropriate than that in as it takes into account the fact 
that we have some knowledge of the final state. Namely, when we start with n systems all 
in the state a we know exactly which rrii = n x tr^cxi) pairs will end up in the state cxj after 
performing a purification procedure. Therefore we can separately access the entanglement in 
each of the possible subensembles described by (Xj. Clearly the total expected entanglement 
at the end should not exceed the original entanglement, which is stated in E3. This, of course, 
does not exclude the possibility that we can select a subensemble whose entanglement per 
pair is higher than the original entanglement per pair. We emphasise that if we assume that 
E{(7) is also convex (as it, indeed, is in the case of the Quantum Relative Entropy presented 
later in the paper) than E3 immediatelly implies that E{J2iViaV^) < E[a). On the other 
hand, convexity of E{a) and E{J2i yi^^^i) ^ E{a) do not imply E3, which also provides a 
reason for requiring E3 rather than the condition in 0. We now introduce three different 
measures of entanglement which obey E1-E3. 

First we discuss the entanglement of creation 0. Bennett et al define the entangle- 
ment of creation of a state p by 

E,{p):=mmY^p,S{p\) (6) 

i 

where S{pa) = — trp^ In pA is the von Neumann entropy and the minimum is taken over 
all the possible realisations of the state, pab = Hj'Pj\''Pj){'4^j\ with p\ = tTB{\ipi){ipi\)- The 
entanglement of creation satisfies all the three conditions E1-E3 0]. The physical basis of 
this measure presents the number of singlets needed to be shared in order to create a given 
entangled state by local operations. We will discuss this in greater detail in Section IV. It 
should also be added that progress has been made recently in finding a closed form of the 
entanglement of creation [Q. 

Related to this measure is the entanglement of distillation ||^. It defines the amount 
of entanglement of a state a as the proportion of singlets that can be distilled using a 
purification procedure (Bennett et al distinguish one and two way communication which 
give rise to two different measures, but we will not go into that much detail; we assume 
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the most general two way communication). As such, it is dependent on the efficiency of a 
particular purification procedure and can be made more general only by introducing some 
sort of universal purification procedure or asking for the best state dependent purification 
procedure. We investigate this in Section V. We now introduce our suggestion for a measure 
of an amount of entanglement. It is seen in Section V that this measure is intimately related 
to the entanglement of distillation by providing an upper bound for it. 

If V is the set of all disentangled states, the measure of entanglement for a state a is 
then defined as 



where D is any measure of distance (not necessarily a metric) between the two density 
matrices p and a such that E{a) satisfies the above three conditions E1-E3 (see Fig. 1). 

Now the central question is what condition a candidate for D{a\\p) has to satisfy in 
order for E1-E3 to hold for the entanglement measure? We present here a set of sufficient 
conditions. 

Fl. D[(t\\p) > with the equality saturated iS a — p. 

F2. Unitary operations leave D{a\\p) invariant, i.e. D{a\\p) = D{UaUmUpW). 
F3. D{trpa\\trpp) < D{a\\p), where trp is a partial trace. 

F4. T,PiD{ai/pi\\pi/qi) < J2D{ai\\pi), where pi = tr{ai), qi = tr{pi) and (7j = ViaV^ and 
Pi = ^iP^i (note that V^'s are not necessarily local). 

F5a. D{XiPi(^Pi\\Y.iPipPi) = EiD{Pi(^Pi\\PipPi): where Pi is any set of orthogonal 
projectors such that PiPj = SijPi. 

F5b. D{a ® Pa\\p ® Pa) = -D(cr||p) where Pa is any projector. 

Conditions Fl and F2 ensure that El and E2 hold; F2, F3, F4 and F5 ensure that E3 is 
satisfied. The argument for the former is trivial, while for the latter it is more lengthy and 
will be presented in the remainder of this section. 



E{a) :— min D{a 
peTi 




(7) 
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C. Proofs 



We claim that F2, F3, F4 and F5 are sufficient for E3 to be satisfied and hence need 
to prove that F2 — F5 =^ E3. If F2, F3 and F5b hold, then we can prove the following 
statement, 

Theorem 1. For any completely positive, trace preserving map $, given 
andj:V^Vi = 1, we have that D{^a\\<!?p) < D{a\\p). [] 

Proof. It is well known that a complete measurement can always be represented as a unitary 
operation+partial tracing on an extended Hilbert Space Ti^Tini where diniHn = n |TT,T^. 
Let {\i)} be an orthonormal basis in Hn and \a) be a unit vector. So we define, 

W = Y.V,^\t){a\ . (8) 

i 

Then, W^W = l^Pa where Pa = \a){a\, and there is a unitary operator U in Ti^Tin such 
that W = U{1® Pa) [lig. Consequently, 



U{A®Pa)U^ = Y,V,AV] , (9) 

SO that, 

tr2{U{A®Pa)U^] = Y.V^AV,^ . (10) 

i 

Now using F3, then F2, and finally F5b we find the following 

D{tr2{Uia ® Pa)W} \ \ tr2{U{p ® Pa)U^}) (11) 

<D{U{cT®Pa)U^\U{p®Pa)U^) (12) 

= D{a Pa\\p Pa) (13) 
= Dia\\p) . (14) 

This proves Theorem 1 □. 



^We frequently interchange the $ and notations for one another throughout this section. 
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Corollary. Since for a complete set of orthonormal projectors P, J2i Pi<^Pi is a complete 
positive trace preserving map, then 

Y,D{PiaPi\\PipPi)<D{a\\p) . (15) 

i 

(The sum can be taken outside as F5a requires that D(J2iPi(^Pi\\J2iPipPi) = 
J2tD{PiaPi\\P,pPi)). Now from F2, F3, F5b and eq.(|l5D we have the following 
Theorem 2. If (t^ = ViaV^ then J2 D{(^t\\pt) < D{(^\\p)- 

Proof. Equations (||) and (y) are introduced as in the previous proof. From eq. (^ we have 
that 

tr2{l®PiUiA(E)Pa)U^l(»Pi} = ViAV^ . (16) 
where Pi = Now, from F3, the Corollary and F5b it follows that 

D {tr2{l^ PiU{(r ® P^)U^1^ Pi}\\tr2{l^ PiU{p ® P^)U^1® Pi}) (17) 

i 

< Y,D{1^ PiU{a P^)W1^ Pi\\l^ PiU{p Pa)Wl^ Pi) (18) 

i 

<D{U{a®P,^U^\U{p®Pa)U^) (19) 

= D{a®P^\\p®Pc,) (20) 

= D{a\\p) . (21) 

This proves Theorem 2 □. 

From Theorem 2 and F4 we have, 

Y^p,DC-f-^)<D{a\\p) . (22) 
Pi Qi 

Now let E{a) = D{a\\p*), i.e. let the minimum of -D(cr||p) over all p be attained at p*. 
Then from eq. (^) 

E{a) := D{a\\p*) > ^p,D(^||^^^) > Y.P^ E{ct./p^) (23) 

Pi Qi 

and E3 is satisfied. Note that in all the proofs for D{a\\p) we never use the fact that the 
completely positive, trace preserving map $ is local. This is only used in the last inequality 
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of eg. p3|) where LGM (+CC+PS) maps disentangled states onto disentangled states. This 
ensures that p* is disentangled and therefore D{ai/pi\\p*/qi) > E{ai/pi). So, the need for 
local $ arises only in eq. (p3D; otherwise all the other proofs hold for a general $. Note also 
that one can prove, by the same methods, a slightly more general condition 

E3*. The expected entanglement of the initial state a"" = ai ® . . . ® o"„ cannot increase 
under LGM+CC+PS given by E V^Vi = 1, i.e. 

E{a^) = E{a, ® . . . ® (t„) > Y.tr{V,a^V^) E{V,a^V^ /triV.a'^V^)) . (24) 
However, in the following we will not make use of this generalization. 



D. Two Realisations of D{a,p) 

In this section we show that F1-F5 hold for the Quantum Relative Entropy and for the 
Bures metric, which as we have seen immediately renders them generators of a good measure 
of entanglement. 



1. Quantum Relative Entropy 

We first prove F1-F5 for the Quantum Relative Entropy, i.e. when D(cr||p) = S'(crllp) := 
Tr {(T(lno" — Inp)} (Note that the Quantum Relative Entropy is not a true metric, as it is 
not symmetric and does not satisfy the triangle inequality. In the next section the reasons for 
this will become clear. For further properties of the Quantum Relative Entropy see [13 -0.) 



Properties Fl and F2 are satisfied [16]. F3 follows from the strong subadditivity property of 
the von Neumann Entropy [|ll|,|l3-|l3 • Since J2 S{cri\\pi) = Y.PiS{ai/pi\\pi/qi)+Y.P-MPi/qi 
and I] Pi In ^ > (see []T^ for proof) F4 is also satisfied. Property F5 can be proved to 
hold by inspection [|TT|]. Now, a question arises as to why the entanglement is not defined 
as E{a) = miupgx) S{p\\a). Since the Quantum Relative Entropy is asymmetric this gives a 
different result to the original definition. However, the major problem with this convention 
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is that for all pure states this measure is infinite. Although this does have a sound statistical 
interpretation (see the next section) it is hard to relate it to any physically reasonable scheme 
(e.g. a purification procedure) and, in addition, it fails to distinguish between different 
entangled pure states. This is the prime reason for excluding this convention from any 
further considerations. The measure of entanglement generated by the Quantum Relative 
Entropy will hereafter be referred to as the Relative Entropy of Entanglement. 

Properties of the Relative Entropy of Entanglement 

For pure, maximally entangled states we showed that the Relative Entropy of Entanglement 
reduces to the von Neumann reduced entropy p. We also conjectured |^ that for general 
pure state this would be true. Now we present a proof of this conjecture. In short, our proof 
goes as follows: we already have a guess as to what the minimum for a pure state a should 
be-say, it is a disentangled state p* . Then we show that the gradient ^S'((t||(1 — x)p* +xp) 
for any p G is nonnegative. However, if p* was not a minimum the above gradient would 
be strictly negative which is a contradiction. Now we present a more formal proof ||2^ 
that applies to arbitrary dimensions of the two subsystems. An alternative proof that also 
applies to arbitrary dimensions will be given in section III. In the appendix we present 
a third proof that is restricted to two spin 1/2 systems but which can be generalized to 
arbitrary dimensions. 

Theorem 3. For pure states a = J2nin2 A/Pn7Pnll0niV'ni) (0n2V'n2 1 the Relative Entropy of 

Entanglement is equal to the Von Neumann reduced entropy, i.e. E{o-) = —J2nPn^^Pn- 

Proof. For a > 0, log a = /o°°^^y^' thus, for any positive operator A, log A = 

lo^^TW- Let f{x,p) = S{a\\{l-x)p*+xp). Then 

^(0, p) = - hm t,{^(M(l-^)p-+^p)-logp-)^ 
ox X 



tr(a/ (p* + t)-\p*-p)ip* + t)-'dt) 
Jo 

1- / tT{a{p* + t)-'p{p* +t)-')dt 
Jo 

1 - / tr((p* + t)-V(p* + t)-'p)dt (25) 
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Take p* — J2nPn\<Pn'4^n) {(pni^nl (this is our guess for the minimum). Then 

™i,n.2,«3,"4 

VPn2Pn3 1 0n2^n2 ) (^ns^ns I {Pn^ + ^)"^ |0n4^n4) (0n4^. 



^^(Pn + t) ^y/PnPn'{Pn' + t) ^|0nV'n) (^n'V'n'l ■ (26) 
n,n' 

Set g{p,q) — J^{p + t)~^y/pq{q + t)~^dt. Then it follows that g{p,p) — 1 and, for p < g, 

r°° 1 11 

g{p.q)^Vpq (—7-— 7) dt (27) 

JO p + t q + t q —p 

Vp^ 1 ^ ^oQ^ 
= log - . (28 

q-p P 

lemma < g{p,q) < I for all p, g G [0, 1]. 

proof. We know that g{p, q) = y/pq!^{p + t)~^{q + ty^dt. But, 

(p + t)(g + i)=pg + t(p + g)+i'>pg + 2iVpg + ^' = (VP? + i)' , (29) 

and so 

roo 

g{p, q)<s/pq (Vm + ty^dt = 1 . (30) 

t/ 

Let p — \oi){a\ <S> \P){P\ where jo;) = J2n'^n\<Pn) ^-nd (3 — Y^n^ni^n ^-re normalized vectors. 
Then 

1^(0, p) - 1 = -tril^ip* + t)-'a{p* + tr'dtp) 

= -'tl^( 9{Pni,Pn2)\(l>m'^m){(l>n2^n2\ 

ni,n2,n3,n4,?i5,ng 

C^na ^714 '^ns &n6l0n3V'n4)(0n5^™6l) 
= - H 9{Pni,Pn2)0'n2bn2dnJ)ni (31) 



ni,n2 



and 



l^(0,p)-l|< XI lOmll^nillanall^'nzl 



ni,n2 



= (El«n||&n|)'<El«nPEl^nP = l ■ (32) 
n n n 

13 



Thus it follows that ^{0,\a(3){a(3\) > 0. 

But any p E V can be written in the form p = I]j '^j|ct*/3*)(ct*/3*| and so |^(0,p) 
E^n%{0,\a^(3'){a^(3'\)>0. 



Proposition Let ^ E H have Schmidt decomposition pi 



\^) =J2VP^\^ni^n) (33) 

n 

and set cr = 1$) ($|. Then E{a) = — J2nPn^ogpn- 

Proof. S{a\\p*) = —J2nPn^ogpn SO it is sufficient to prove that S{a\\p) > S{(t\\p*) for all 
p E D. Suppose that S{a\\p) < S{a\\p*) for some p eV. Then, for < a; < 1, 

/(x, p) = I (1 - x)p* +xp)<{l- x)S{a\\p*) + xSia\\p) 

= (l-x)/(0,p)+x/(l,p) . (34) 



This implies 

/(x,p)-/(0,p) 



X 



</(l,p)-/(0,p)<0 . (35) 



This is impossible since ^(0,p) = limj^^o -^(^'^^^-^C^'^) > q. This therefore proves the above 
proposition □. 

Therefore we have shown that for arbitrary dimensions of the subsystems the entropy 
of entanglement reduces to the entropy of entanglement for pure states. This is, in fact, a 
very desirable property, as the entropy of entanglement is known to be a good measure of 
entanglement for pure states. In fact one might want to elevate Theorem 3 to a condition 
for any good measure of entanglement, i.e. 

E4: For pure states the measure of entanglement reduces to the entropy of entanglement, 
i.e. 

E{a) = -tr{aA\naA} , (36) 

with aA = trBicr} being the reduced density operator of one subsystem of the entangled 
pair. 
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However, in subsection 2 we will see that measures which do not satisfy E4 can nevertheless 
contain useful information. We will discuss this point later in this paper. 

We would like to point out another property of the relative entropy of entanglement that 
helps us find the amount of entanglement. It gives us a method to construct from a density 
operator a with known entanglement a new density operator a' with known entanglement. 
Theorem 4. If p* minimizes S{a\\p*) over p &T> then p* is also a minimum for any state 
of the form ax = — x)a + xp* . 
Proof. Consider, 

S{cTx\\p) - S{ox\\p*) = tr{cra;lnp* - ct^e In p} 

= — a;tr(cr In p) — (1 — a;)tr(p* In p) + a;tr(cr In p*) + (1 — a;)tr(p* In p*) 
= x{S{a\\p) - S{a\\p*)} + (1 - x)S{p*\\p) > (37) 

This is true for any p. Thus p* is indeed a minimum of ax □. For completeness we now 
prove here that E{a) is convex. Namely, 

Theorem 5. E{xiai + X2a2) < xiE{ai) + X2E{a2), where Xi + X2 = 1. 

Proof. This property follows from the convexity of the quantum relative entropy in both 

arguments ||TB| 



^(xicri + X2a2||xipi + X2P2) < a;iS'((Ti||pi) + X2S'((T2||P2) • (38) 

Now, 

E{xiai + X2O2) < S {xiai + X2a2 \ \xipl + X2P2) 
< XiS{ai\\pl) + X2S'(o-2||p2) 

= XiE{ai) + X2E{a2) , (39) 

which completes our proof of convexity □. This is physically a very satisfying property of 
an entanglement measure. It says that when we mix two states having a certain amount of 
entanglement we cannot get a more entangled state, i.e. succinctly stated "mixing does not 
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increase entanglement" . This is what is indeed expected from a measure of entanglement to 
predict. 

As a last property we state that the entanglement of creation Ec is never smaller than 
the Relative Entropy of Entanglement E. We will show later that this property has the 
important implication that the amount of entanglement that we have to invest to create 
a given quantum state is usually larger than the entanglement that you can recover using 
quantum state distillation methods. 
Theorem 6. E{a) < E^a) = minpev S{a\\p). 

Proof. Given a state a then by definition of the entanglement of creation there is a convex 
decomposition a = J2Pi<^i with pure states a, such that 



As the entanglement of creation coincides with our entanglement for pure states and as our 
entanglement is convex it follows that 



and the proof is completed □. 

The physical explination of the above result lies in the fact that a certain amount of 
additional knowledge is involved in the entanglement of formation which gives it a higher 
value to the Relative Entropy of Entanglement. This will be explained in full detail in 
section V. We add that the relative entropy of entanglement E{a) can be calculated easily 
for Bell diagonal states 0. Comparing the result to those for the entanglement of creation 
one finds that, in fact, strict inequality holds. In general, we have unfortunately found no 
"closed form" for the relative entropy of entanglement and a computer search is necessary 
to find the minimum p*, for each given a. However, we can numerically find the amount of 
entanglement for two spin 1/2 subsystems very efficiently using general methods independent 
of the dimensionality and the number of subsystems involved which are described in the next 
section. 



(40) 



(41) 
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2. Bures Metric 



Another distance measure that leads to a measure of entanglement that satisfies the 
conditions E1-E3 is induced by the Bures metric. However, it will turn out that it does not 
satisfy condition E4 and is therefore a less useful measure. In fact some people would say 
it is not a measure of entanglement at all, however, we believe that this very much depends 
on the questions one asks. 

We now prove F1-F5 for the Bures metric, i.e. when D{a\\p) = Db(o"||p) := 2 — 



transition probability). Property Fl follows from the fact that the Bures metric is a 
true metric and F2 is obvious. F3 is a consequence of the fact that Db does not in- 



PiqiF{ai/pi, pi/qi) = F{ai,pi), from where F4 immediately follows as qi G [0, 1]. F5 is seen 
to be true by inspection. As conditions F1-F5 are satisfied, it immediately follows that 
conditions E1-E3 are satisfied too. 

In the following present some properties of the Bures measure of entanglement £'^(o"). 
First we show that for pure states we do not recover the entropy of entanglement. 
Theorem 7: For a pure state = a\00) + f3\ll) one has 



Proof. To prove Theorem 7 we have to show that the closest disentangled state to a = 
under the Bures metric is given by p* = a^|00)(00| + To this end we 

consider a slight variation around p* of the form px = {1 — X)p* + Xp where p &T>. Now we 
need to calculate 




1/2 



is the so called fidelity (or Uhlmann's 



crease under a complete positive trace-preserving map |2^. We can also easily check that 



i?B(|^)(^|) = 4a^(l-a^) 



(42) 




(43) 



Using the fact that ^/o" = o" as a is pure we obtain 



d_ 

dX 



DBia\\px)\x=o 



dX 



4' 



;4 + /?4 + A((V^|p|V^)-l)U=o<0 



(44) 
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Using the closest state p* one then obtains eg. (^2]) . To obtain the entanglement of an 
arbitrary pure state one first has to calculate the Schmidt decomposition |^ and than by 
local unitary transformation transform the state to the form = a\00) + As local 

unitary transformations do not change the entanglement, we have therefore shown that the 
Bures measure of entanglement does not reduce to the entropy of entanglement for pure 
states. The proof presented here can be generalized to many dimensional systems but we 
do not state this generalization. 

In fact, it is now easy to see the following 
Corollary. The Bures measure of entanglement for pure states is smaller than the entropy 
of entanglement, i.e. for any pure state a 

(a) < -tr{(TA In cta} (45) 

Proof. One can see quickly that for a G [0, 1] 

Aa^{l- a^) < -a^ In - (1 - a^) ln(l - a^) (46) 

from which the Corollary follows. 

As the Bures measure of entanglement does not satisfy condition E4, i.e. does not reduce 
to the entropy of entanglement for pure states one might argue that it does not provide 
a sensible measure of entanglement. However, it should be noted that the Bures metric 
immediately gives an upper bound on the following very special purification procedure. 
Assume that Alice and Bob are given EPR pairs, but one pair at a time. Then they are 
allowed to perform any local operations they like, and then decide whether we keep the pair 
or discard it. Then, they are given the next EPR pair. The question is, how many pure 
singlet states they can possibly distill out of such a purification procedure. The answer is 
immediately obvious from condition E3. The best that Alice and Bob can do is to have one 
subensemble with pure singlets and all other subensembles with disentangled states. Then 
the probability to obtain a singlet is simply given by the Bures measure of entanglement for 
the initial ensemble. As this is smaller than the entropy of entanglement we have found the 
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nontrivial, though not very surprising result, that this restricted purification procedure is 
strictly less efficient than entanglement concentration described in |^8[ . 



A reasonable candidate to generate a measure of entanglement is the Hilbert-Schmidt 
Metric. Here we have that D{A\\B) = ||A - S|p := tr{A - B)^. Fl follows from the fact 
that — is a true metric, and F2 is obvious. F3 and F4 remain to be shown to hold. 
We also believe that there are numerous other nontrivial choices for D(A\\B) (by nontrivial 
we mean that the choice is not a simple scale transformation of the above candidates). Each 
of those generators would arise from a different physical procedure involving measurements 
conducted on a and p*. None of the choices could be said to be more important than any 
other a priori, but the significance of each generator would have to be seen through physical 
assumptions. To illustrate this point further, let us take an extreme example. Define, 



This measure therefore tells us if a given state a is entangled, i.e. when E{a) = 1, or 
disentangled, i.e. when E{a) = 0. We can call it the "indicator measure" of entanglement. 
It should be noted that this measure trivially satisfies conditions E1-E3. This shows that 
there are numerous different choices for and each is related to different physical 

considerations. We explain the statistical basis of the Relative Entropy of Entanglement in 
Section IV. The Relative Entropy of Entanglement is then seen to be linked very naturally 
to the notion of a purification procedure. First, however, we present an efficient numerical 
method to obtain entanglement for arbitrary particles. 



3. Other candidates 




If entanglement is calculated using this distance, then 
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III. NUMERICS FOR TWO SPIN 1/2 PARTICLES 



In order to understand how our program for calculating the amount of entanglement 
works, we first need to introduce one basic definition and one important result from convex 
analysis From this point onwards we concentrate on the Quantum Relative Entropy as 
a measure of entanglement although most of the considerations are of a more general nature. 
Definition 2. The convex hull {co{A)) of a set A is the set of all points which can be 
expressed as (finite) convex combinations of points in A. In other words, x G co{A) if and 
only if X has an expression of the form x = Ylk=iPkO'k where K is finite, Y.f=iPk = 1, and, 
for k = 1, . . . , K , pk > and Ofc G A. 

We immediately see that the set of disentangled states V is a convex hull of its pure 
states. This means that any state in V can be written as a convex combination of the form 
J2Pn\4'n'4^n){4'n'^n\- Howcvcr, there is now a problem in the numerical determination of the 
measure of entanglement. We have to perform a search over the set of disentangled states in 
order to find that disentangled state which is closest to the state a of which we want to know 
the entanglement. But how can we parametrize the disentangled states? We know that the 
disentangled states are of the form given by Definition 1. However, there the number of 
states in the convex combination is not limited. Therefore one could think that we have to 
look over all convex combinations with one state, then two states, then 1000 states and so 
forth. The next theorem, however, shows that one can put an upper limit to the number 
of states that are required in the convex combination. This is crucial for our minimization 
problem as it shows that we do not have to have an infinite number of parameters to search 
over. 

Carat heodory's theorem. Let A C R^. Then any x G co{A) has an expression of the 
form X = Y.n=i Pndn whcrc Y^n=i Pn = ^, and, for n = 1, . . . , + 1, p„ > and a„ G A. 

A direct consequence of Caratheodory's theorem is that any state in V can be decomposed 
into a sum of at most (dim(ifi) x dim(_f/'2))^ products of pure states. So, for 2 spinl/2 
particles there are at most 16 terms in the expansion of any disentangled state. In addition, 



20 



each pure state can be described using two real numbers, so that there are altogether at 
most 15 + 16 X 4 = 79 real parameters needed to completely characterize a disentangled 
state in this case. 

A random search over the 79 real parameters would still be very inefficient. However, 
we can now make use of another useful property of the relative entropy, which is the fact 
that it is convex. This means that we have to minimize a convex function over the convex 
set of disentangled states. It can easily be shown that any local minimum must also be a 
global minimum. Therefore we can perform a gradient search for the minimum (basically 
we calculate the gradient and then perform a step in the opposite direction and repeat this 
procedure until we hit the minimum). As soon as we have found any relative minimum 
we can stop the search, since this is also a global minimum. To make the gradient search 
efficient we have to chose a suitable parametrization. The parametrization that we use 
has the advantage that it also provides us with another proof of Theorem 3 which states 
that for pure states the Relative Entropy of Entanglement reduces to the von Neumann 
reduced entropy. We first explain the parametrization and then state the alternative proof 
for Theorem 3. The following results can easily be extended to two subsystems of arbitrary 
dimensions but for clarity we restrict ourselves to two spin 1/2 systems. 

Our aim is to find the amount of entanglement of a state a of two spin 1/2 states, i.e. we 
have to minimize tr{(j In cr — cr In p} for all p ^T>. Prom Caratheodory's theorem we know 
that we only need convex combinations of at most 16 pure states p\ to represent p &T>, i.e. 

16 

P-Y.pU®A ■ (47) 

i=l 

(Notice that we use pf instead of pi for convenience, so that here we require that Yh^i Pi — ^)- 
The parametrization we chose is now given by 



15 

Pi — sin cos with 0o 

j=i 



TT 



2 



(48) 



and 
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IV-i) = cosailO) + sina,e^''»|l) (49) 
= cosAlO) + sin/5ie^^'|l) 

All angles ai, l3i,(j)i,rii, fii can have arbitrary values, but due to the periodicity only the in- 
terval [0, 2tt] is really relevant. Numerically this has the advantage that our parameter space 
has no edges at which problems might occur. The program for the search of the minimum 
is now quite straightforward. The idea is that given a we start from a random p, i.e. we 
generate 79 random numbers. Then we compute S{(j\\p), as well as small variations of the 79 
parameters of p, to obtain the approximate gradient of ^(crl |p) at the point p. We then move 
opposite to the gradient to obtain the next p. We continue this until we reach the minimum. 
As explained before, a convex function over a convex set can only have a global minimum, 
so that the minimum value we end up with is the one and only. The method outlined above 
immediately generalizes to two subsystems of arbitrary dimension, however, the number of 
parameters rises quickly to large values which slows down the program considerably. 

Before we state some numerical results we now indicate an alternative proof of Theorem 
3 using Caratheodory's theorem and the parametrization given in eqs. (^Tf) - (^). For this 
proof we use the fact that we can represent the logarithm of an operator p by 

If 1 

Inp = (p \nz— (50) 

where the path of integration encloses all eigenvalues of p. We can now take the partial 
derivative of In p with respect to a parameter on which p might depend. 

91np 1 /, 1 dp 1 

— — ^ = (p In^ TT- • (51) 

d(j) 2m J zl - p d(f) zl - p ^ ^ 

Now, we have a given pure state 



a = a2|oo)(00| + aVl - a^{\00){ll\ + |11)(00|) + (1 - a^)|ll)(ll| (52) 

The suspected closest approximation to a within the disentangled states is given by 

p™„ = a2|00)(00| + (l-a2)|ii)(ii| . (53) 
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If we want to represent pmin using the parametrization given in eqs. (|47|) - (^91) then we find 
for these parameters cos^ 0i = ; 0^2 = /52 = f and zero for all other parameters. Using 
eq. (|5T| ) one can now calculate all the partial derivatives of the relative entropy around the 
point Pmin- It is easy, but rather lengthy, to check that these derivatives vanish and that 
therefore pmin is a relative minimum. This concludes the proof as a relative minimum of a 
convex function on a convex set is also a global minimum. 

After this additional proof of Theorem 3 we now state some results that we have ob- 
tained or confirmed with the program that implements the gradient search. We present four 
nontrivial states a for which we can find the closest disentangled state p that minimize the 
Quantum Relative Entropy thereby giving the Relative Entropy of Entanglement. Using 
the same ideas as for the proof of Theorem 3 in Eq. (|50| - ^3]) one can then prove that these 
are indeed the closest disentangled states. 
Example 1. 

cTi = A|$+)(<l>+| + (1 - A)|01)(01| (54) 
Pi = ^(1 - ^)|00)(00| + ^(1 - ^){|00)(11| + H.C.} + 

(1 - ^)'|01)(01| + ^|10)(10| + ^(1 - ^)|11)(11| (55) 
E(ai)= (A - 2) ln(l - ^) + (1 - A) ln(l - A) . (56) 

Here |$+) is one of the four Bell states defined by 

I<^>^) = ^(|00)±|11)) (57) 
l^^) = 4(|01)±|10)) (58) 



Example 2. 



= A|<l>+)($+| + (1 - A)|00)(00| (59) 

P2 =(1-^)|00)(00| + ^|11)(11| (60) 

E{a,)= ln.+ + ln._ - (1 - ^) Ml " ^) " (1 " ^) 1^(1 " \) , (61) 
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where 

1±^1-2A(1-|) 
s± = (62) 

are the eigenvalues of (72- One could argue that in the above two cases the following reasoning 
can be applied: (Ti(2) is a mixture of a maximally entangled state (for which the amount 
of entanglement is given by In 2) and a completely disentangled state {E = 0). Thus one 
would expect a total amount of entanglement of A In 2. It is curious that this reasoning does 
not work for either of the two states, since, in fact, £'((7i(2)) < A In 2. Now, we show how to 
use Theorem 4 to generate more states and their minima. For pure states o"^ = a we know 
the minimum p. Now, the state that is a convex sum of a and p should also have the same 
minimum p. So, 
Example 3. 

(73 = A|00)(00| + fi|00)(ll| + fi*|ll)(00| + (1 - A)|ll)(ll| (63) 
P3 =A|00)(00| + (1-A)|11)(11| (64) 
E(cT3)=e+lne+ + e_lne_-Aln^-(l-A)ln(l-A) , (65) 

where 

1±^1-4A{1-A)-\B\^ 
e± = . (66) 

Using Theorem 4, the amount of entanglement can be found for a number of other spin 1/2 
states. Our program can also help us infer the entanglement of some other non-trivial states 
as the last example shows. 
Example 4. 

(74= A|00)(00| + S|00)(11| + S*|11)(00| + (1 - 2^)|01)(01| + A|ll)(ll| (67) 
P4= C|00)(00| + £'|00)(11| + D*|11)(00| + E|01)(01| (68) 
+(1-2C-£;)|10)(10| + C|11)(11| , (69) 

where 
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{l-2A){l-Af 
{l-AY-B^ 
C=l-A-E 



(70) 



(71) 



D= ^E{1 -E-2C) 



{l-2A){l-A) 



B 



(72) 



It is now easy to compute the amount of entanglement from the above information. 

In addition to the above described methods there is a simple way of obtaining a lower 
bound for the amount of entanglement for any two spin 1/2 system. Suppose that we 
have a certain state a. We first find the maximally entangled state \ip) such that the 
fidelity F = {tp\a\ilj) is maximized. Then we apply local unitary transformations to a which 
transform into the singlet state (this is, of course, always possible). Now, we apply local 
random rotations to both particles. These will transform a into a Werner state, where 
the singlet state will have a weight F (since it is invariant under rotations) and all the other 
three Bell states will have equal weights of (1 — F)/3 (since they are randomized). Since 
these operations are local they cannot increase the amount of entanglement, and we have 
that for any a 



where Wp is the above described Werner state (the Relative Entropy of entanglement for a 
general Bell diagonal state is calculated in [||). 

We note that this efficient computer search provides an alternative criterion for deciding 
when a given state a of two spin 1/2 systems is disentangled, i.e. of the form given in 
eq. (^. The already existing criterion is the one given by Peres and Horodecki family (see 
second and third references in |jl[), which states that a state is disentangled iff its partial 
trace over either of the subsystems is a non-negative operator. This criterion is only valid 
for two spin 1/2, or one spin 1/2 and one spin 1 systems. In the absence of a more general 
analytical criterion our computational method provides a way of deciding this question. In 
addition we would like to point out that the program is also able to provide us with the 
convex decomposition of a disentangled state p. 



E{a) >E{Wf) =FlnF+(l-F)ln(l-F)+ln2 



(73) 
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At the end of this section we mention additivity as an important property desired from 
a measure of entanglement, i.e. we would like to have 

E{ai2 ® ^34) = E{ai2) + E{a^i) , (74) 

where systems 1 + 2 and systems 3 + 4 are entangled separately from each other. The exact 
definition of the left hand side is 

E{ai2 ® 0-34) = min 5(0-12 ® cr34 1 1 PiPls ® P24) • (75) 

% 

Why this form? One would originally assume that Oyi ® 034 should be minimized by the 
states of the form (J2iPiPi ® P2) ® (J^jPjPi ® Pa)- However, Alice, who holds systems 1 and 
3, and Bob, who holds systems 2 and 4, can also perform arbitrary unitary operation on 
their subsystems (i.e. locally). This obviously leads to the creation of entanglement between 
1 and 3 and between 2 and 4 and hence the form given in eq. (^^- Additivity is, of course, 
already true for the pure states, as can be seen from the proof above, when our measure 
reduces to the von Neumann entropy. For more general cases we were unable to provide an 
analytical proof, so that the above additivity property remains a conjecture. However, for 
two spin 1/2 systems, our program did not find any counter-example. It should be noted 
that it is easy to see that we have 

Eiau ® CT34) < E{au) + E{ctu) ■ (76) 

In the following we will assume that Eq. (^) holds and use it in Section V to derive certain 
limits to the efficiency of purification procedures. 



IV. STATISTICAL BASIS OF ENTANGLEMENT MEASURE 

Let us see how we can interpret our entanglement measure in the light of experiments, 
i.e. statistically. This was presented in [0 in a greater detail. Here we present a summary 
which is sufficient to understand the following Section. Our interpretation relies on the 



result concerning the asymptotics of the Quantum Relative Entropy first proved in , and 
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here presented under the name of Quantum Sanov's Theorem. We first show how the notion 
of Relative Entropy arises in classical information theory as a measure of distinguishability 
of two probability distributions. We then generalize this idea to the quantum case, i.e. to 
distinguishing between two quantum states (for a discussion of distinguishability of pure 



quantum states see e.g. We will see that this naturally leads to the notion of the 

Quantum Relative Entropy. It is then straightforward to extend this concept to explain the 
Relative Entropy of Entanglement. Suppose we would like to check if a given coin is "fair", 
i.e. if it generates a "head-tail" distribution of / = (1/2, 1/2). If the coin is biased then it will 
produce some other distribution, say uf = (1/3,2/3). So, our question of the coin fairness 
boils down to how well we can differentiate between two given probability distributions 
given a finite, n, number of experiments to perform on one of the two distributions. In 
the case of a coin we would toss it n times and record the number of O's and I's. From 
simple statistics we know that if the coin is fair than the number of O's A^(0) will be roughly 
n/2 — ^/n < N{0) < n/2 + ^/n, for large n and the same for the number of I's. So if our 
experimentally determined values do not fall within the above limits the coin is not fair. We 
can look at this from another point of view; namely, what is the probability that a fair coin 
will be mistaken for an unfair one with the distribution of (1/3,2/3) given n trials on the 
fair coin? For large n the answer is |]7|,[l9 



p(fair ^ unfair) = e-'^^^'^'^^H^) , (77) 

where (?^/ 1 1 /) = 1 /3 In 1/3 + 2/3 In 2/3 - 1 /3 In 1 /2 - 2/3 In 1 /2 is the Classical Relative 
Entropy for the two distributions. So, 

p(fair ^ unfair) = 3"2-t" , (78) 

which tends exponentially to zero with n — > oo. In fact we see that already after ~ 20 trials 
the probability of mistaking the two distributions is vanishingly small, < 10^^°. 

This result is true, in general, for any two distributions. Asymptotically the probability 
of not distinguishing the distributions P{x) and Q{x) after n trials is e~"'^'^'(^(^)ll'5(^)), where 
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Sci{P{x)\\Q{x)) = ^pilnpi- pilnqi (79) 

i 

(this statement is sometimes called Sanov's theorem To generalize this to quantum 

theory, we need a means of generating probability distributions from two quantum states a 
and p. This is accomplished by introducing a general measurement Ej J2i Ei = 1. So, the 
probabilities are given by 

Pi = tr{E}Eip) 

qi=tr{EjEia) . (80) 

Now, we can use eq. (|79|) to distinguish between a and p. The above is not the most general 
measurement that we can make, however. In general we have copies of cr and p in the 
state 

cr^ = (7(8)0- . .. ®a (81) 

total of N terms 
p^ = p®p...®p (82) 

total of N terms 

We may now apply a POVM I^jAj = U acting on and p^ . Consequently, we define a 
new type of relative entropy 

Sn{(t\\p) := supA'sl^E^^^i^^ln^^^i^^ -^^^^^^In^^^iP^} (83) 

i 

Now it can be shown that \TB 



S{a\\p)>SN (84) 

where, as before, 

S{a\\p) ■.= tr{a\na - a\np) (85) 

is the Quantum Relative Entropy (for the summary of the properties of the 

Quantum Relative Entropy see |T^). Equality is achieved in eq. iff a and p commute 
. However, for any a and p it is true that 



28 



S{a\\p) = lim Sn ■ 

N—>oo 

In fact, this limit can be achieved by projective measurements which are independent of 



a |2^. It is known that if eq. (|79|) is maximized over all general measurements E, the 
upper bound is given by the quantum relative entropy (see e.g. [0). In quantum theory we 
therefore state a law analogous to Sanov's theorem (see also 0), 

Theorem 8 (or Quantum Sanov's Theorem). The probability of not distinguishing two 
quantum states (i.e. density matrices) a and p after n measurements is 

p{p a) = e-^^^'ll") (86) 

In fact, as explained before, this bound is reached asymptotically and the measure- 



ments achieving this are global projectors independent of the state a |2^. We note that 



the Quantum Sanov Theorem was presented by Donald in as a definition justified by 



properties uniquely characterizing the quantity e"'"'^*^'^"''''. The underlying intuition in the 
above measurement approach and Donald's approach are basically the same. Now the in- 
terpretation of the Relative Entropy of Entanglement becomes immediately transparent [0 . 
The probability of mistaking an entangled state a for a closest, disentangled state, p, is 
^-nxminpe-pS{a,p) _ ^-nE{a) ^ j£ ^^iq amouut of entanglement of a is greater, than it takes fewer 
measurements to distinguish it from a disentangled state (or, fixing n, there is a smaller 
probability of confusing it with some disentangled state). Let us give an example. Consider 
a state (|00) + |ll))/v^, known to be a maximally entangled state. The closest to it is 
the disentangled state (|00)(00| + |ll)(ll|)/2 0. To distinguish these states it is enough to 
perform projections onto (|00) + |ll))/-\/2. If the state that we are measuring is the above 
mixture, then the sequence of results (1 for a successful projection, and for an unsuccessful 
projection) will contain on average an equal number of O's and I's. For this to be mistaken 
for the above pure state the sequence has to contain all n I's. The probability for that is 
2~", which also comes from using eq. (^). If, on the other hand, we performed projections 
onto the pure state itself, we would then never confuse it with a mixture, and from eq. ( |5BD 
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the probability is seen to be e °° = 0. We next apply this simple idea to obtaining an upper 
bound to the efficiency of any purification procedure. 

V. THERMODYNAMICS OF ENTANGLEMENT: PURIFICATION 

PROCEDURES 

There are two ways to produce an upper bound to the efficiency of any purification 
procedure. Using condition E3 and the fact that the Relative Entropy of Entanglement is 
additive, we can immediately derive this bound. However, this bound can be derived in 
an entirely different way. In this section we now abandon conditions E1-E3 and use only 
methods of the previous section to put an upper bound to the efficiency of purification 
procedures. In particular, we show that the entanglement of creation is in general larger 
than the entanglement of distillation. This is in contrast with the situation for pure states 
where both quantities coincide. The Quantum Relative Entropy is seen to play a distinctive 
role here, and is singled out as a 'good' generator of a measure of entanglement from among 
other suggested candidates. 

A. Distinguishability and Purification Procedures 

In the previous section we presented a statistical basis to the Relative Entropy of En- 
tanglement by considering distinguishability of two (or more) quantum states encapsulated 
in the form of the Quantum Sanov Theorem. We now use this Quantum Sanov Theorem to 
put an upper bound on the amount of entanglement that can be distilled using any purifi- 
cation procedure. This line of reasoning follows from the fact that any purification scheme 
can be viewed as a measurement to distinguish entangled and disentangled quantum states. 
Suppose that there exist a purification procedure with the following property 

• Initially there are n copies of the state a. If a is entangled, then the end product is 
< m < n singlets and n — m states in p e D. Otherwise, the final state does not 
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contain any entanglement, i.e. m = (in fact, there is nothing special about singlets: 
the final state can be any other known, maximally entangled state because these can 
be converted into singlets by applying local unitary operations). 



Note that we can allow the complete knowledge of the state a. We also allow that purification 
procedures differ for different states a. Perhaps there is a "universal" purification procedure 
independent of the initial state. However, in reality, this property is hard to fulfill 0. At 
present the best that can be done is to purify a certain class of entangled states, (see e.g. 
p8|-p0[). The above is therefore an idealization that might never be achieved. Now, by 
calculating the upper bound on the efficiency of a procedure described above we present 
an absolute bound for any particular procedure. We ask: 'What is the largest number of 
singlets that can be produced (distilled) from n pairs in state a" ? Suppose that we produce 
m pairs. We now project them non-locally onto the singlet state. The procedure will yield 
positive outcomes (1) with certainty so long as the state we measure indeed is a singlet. 
Suppose that after performing singlet projections onto all m particles we get a string of m 
I's. From this we conclude that the final state is a singlet (and therefore the initial state a 
was entangled). However, we could have made a mistake. But with what probability? The 
answer is as follows: the largest probability of making a wrong inference is 2"™" = e"*^'"^^ (if 
the state that we were measuring had an overlap with a singlet state of 1/2). On the other 
hand, if we were measuring a from the very beginning (without performing the purification 
first), then the probability (i.e. the lower bound) of the wrong inference would be e""^*^'^-'. 
But, purification procedure might waste some information (i.e. it is just a particular way of 
distinguishing entangled from disentangled states, not necessarily the best one), so that the 
following has to hold 



—m In 2 



(87) 



which implies that 



nE{a) > m , 



(88) 
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i.e. we cannot obtain more entanglement than is originally present. This, of course, is also 
directly guaranteed by our condition E3. The above, however, was a deliberate exercise 
in deriving the same result from a different perspective, abandoning conditions E1-E3. 
Therefore the measure of entanglement given in eq. (|^), when D{a\\p) = S{cx\\p), can be 
used to provide an upper bound on the efficiency of any purification procedure. For Bell 
diagonal states. Rains |^ found an upper bound on distillable entanglement using completely 
different methods. It turns out that the bound that he obtains in this case is identical to 
the one provided by the relative entropy of entanglement. 

Actually, in the above considerations we implicitly assumed that the entanglement of n 
pairs, equivalently prepared in the state a, is the same as n x E{(t). We already indicated 
that this is a conjecture with a strongly supported basis in the case of the Quantum Relative 
Entropy. Based on the upper bound considerations we can introduce the following definition. 
Definition 3. A purification procedure given by a local complete positive trace preserving 
map c ^ Vi^Vi is defined to be ideal in terms of efficiency iff 



where, as usual, = ViaV^ and pi = triViaVi) (i.e. a the ideal purification is the one where 
E3 is an equality rather than an inequality). Notice an apparent formal analogy between a 
purification procedure and the Carnot cycle in Thermodynamics. The Carnot cycle is the 
most efficient cycle in Thermodynamics (i.e. it yields the greatest "useful work to heat" 
ratio), since it is reversible (i.e. it conserves the thermodynamical entropy). We would now 
like to claim that the ideal purification procedure is the most efficient purification procedure 
(i.e. it yields the greatest number of singlets for a given input state), since it is reversible 
(i.e. it conserves entanglement, measured by the minimum of the Quantum Relative Entropy 
over all disentangled states). Unfortunately this analogy between the Carnot cycle and 
purification procedures is not exact (it is only strictly true for the pure states). This is seen 
when we compare the entanglement of creation with the Relative Entropy of Entanglement. 
In Theorem 6 we have, in fact, shown that the entanglement of creation is never smaller 




(89) 
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than the Relative Entropy of Entanglement. As an example one can consider Bell diagonal 
states for which we can exactly calculate both the entanglement of creation ^ and the 
Relative Entropy of Entanglement It turns out that the entanglement of creation is 
always strictly larger than the Relative Entropy of Entanglement except for the limiting 
cases of maximally entangled Bell states or of disentangled Bell diagonal states (see Fig. 2 
for Werner states). This result leads to the following 

Implication. In general, the amount of entanglement that was initially invested in creation 
of (J cannot all be recovered ( "distilled" ) by local purification procedures. 
Therefore, the ideal purification procedure, though most efficient, is nevertheless irreversible, 
and some of the invested entanglement is lost in the purification process itself. The solution 
to this irreversibility lies in the loss of certain information as can easily be seen from the 
following analysis. Suppose we start with an ensemble of singlets and we want to locally 
create any mixed state a. Now a can always be written as a mixture of pure states \E'i, ^^2, ••• 
with the corresponding probabilities pi,p2,... We now use Bennett et al's (de)purification 
procedure for pure states (whose efficiency is governed by the von Neumann entropy). 
We convert the first pi x N singlets into the state \E'i, second p2 x N singlets into the state 
\E'2, and so on... In this way, the whole ensemble is in the state a. But, we have an additional 
information: we know exactly that the first pi x N pairs are in the state second p2 x N 
states are in the state "^2, and so on. This is not the same as being given an initial ensemble 
of identically prepared pairs in the state sigma without any additional information. In this, 
second, case we do not have the additional information of knowing exactly the state of each 
of the pairs. This is why the purification without this knowledge is less efficient, and hence 
one expects that the Relative Entropy of Entanglement is smaller than the entanglement of 
formation. 

An open question remains as to whether we can use some other generator, such as the 
Bures Metric, to give an even more stringent bound on the amount of distillable entangle- 
ment. 
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B. More Than two Subsystems 



We see that the above treatment does not refer to the number (or indeed dimensionahty) 
of the entangled systems. This is a desired property as it makes our measure of entanglement 
universal. However, in order to perform minimization in eq. (|^ we need to be able to define 
what we mean by a disentangled state of say particles. As pointed out in we believe 
that this can be done inductively. Namely, for two quantum systems, Ai and A2, we define 
a disentangled state as one which can be written as a convex sum of disentangled states of 
Ai and A2 as follows [§],[7[: 

Pl2 = T.P^Pf'®Pf" ' (90) 
i 

where J2iPi = 1 and the p's are all positive. Now, for entangled systems Ai, A2, ...An, 
the disentangled state is: 

perm{iii2...ijv} 

where Eperm{n^2...^^r} '^ni2...iiv = all r's are positive and where Eperm{ni2...iiv} a sum 
over all possible permutations of the set of indices {1, 2, A^}. To clarify this let us see 
how this looks for 4 systems: 

P1234 = E^ P^ Pf'^'^' ® Pf' + q^ pf'^'^' ® pf' 

+ npf^^'^'^pf^ + s,pf^^'^'^pf^ 
+ U pf'^' ® pf'^' + pf^^' ® pf'^' 

+ v,pf''''®pf'^' (92) 

where, as usual, all the probabilities Pi,qi,...,Vi are positive and add up to unity. The 
above two equations, at least in principle, define the disentangled states for any number of 
entangled systems. Note that this form describes a different situation from the one given 
in eq. ( |75D which refers to a number of pairs shared by Alice and Bob only. The above 
definition of a disentangled state is justified by extending the idea that local actions cannot 
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increase the entanglement between two quantum systems 0,^,0]. In the case of particles 
we have parties (Alice, Bob, Charlie, ... , Wayne) all acting locally on their systems. The 
general action that also includes communications can be written as 0| 

P — E^ii ® ® ■■■ ® W,,pA ® Bl ... ® Wl (93) 

n,«2,...,/jv 

and it can be easily seen that this action does not alter the form of a disentangled state 
in eqs. ( pl] , p2|) . In fact, eq. (|9ll) is the most general state invariant in form under the 
transformation given by eq. (^). This can be suggested as a definition of a disentangled 
state for A^ > 3, i.e. it is the most general state invariant in form under local POVM and 
classical communications. Of course, an alternative to defining a disentangled state would 
be 

i 

which means that we do not allow any entanglement in any subset of the A^ states. This 
would be a disentangled state based on some local hidden variable model. Again we repeat 
that the particular choice of a form of disentangled states will depend on the physical back- 
ground in our model and there is no absolute sense in which we can resolve this dichotomy. 
It should be stressed that for two particles this free choice does not exist as both pictures 
coincide. 

VI. CONCLUSIONS 

We can look at the entanglement from two different perspectives. One insists that local 
actions cannot increase entanglement and do not change it if they are unitary. The other one 
looks at the way we can distinguish an entangled state from a disentangled one. In particu- 
lar, the following question is asked: what is the probability of confusing an entangled state 
with a disentangled one after performing a certain number of measurements? These two, 
at first sight different approaches, lead to the same measure of entanglement. This results 
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in the fact that a purification procedure can be regarded as a protocol of distinguishing an 
entangled state from a disentangled set of states. From this premise we derived the upper 
bound on the efficiency of any purification procedure. It turns out that distillable entangle- 
ment is in general smaller than the entanglement of creation. Our entanglement measure 
is independent on the number of systems and their dimensionality. This suggests applying 
it to more than two entangled systems in order to understand multi-particle entanglement. 
We have shown how to compute entanglement efficiently for two spin 1/2 subsystems using 
computational methods. However, a closed form for the expression of this entanglement 
measure is desirable. However, a closed form for the entanglement of formation has been 
proposed for two spin 1/2 particles in [Q. An interesting problem is to specify all the states 
that have the same amount of entanglement. We know that all the states that are equivalent 
up to a local unitary transformation have the same amount of entanglement (by definition- 
E2). However, there are states with the same amount of entanglement but which are not 
equivalent up to a local unitary transformation (for example one state is pure and the other 
one is mixed). A question for further research is whether they are linked by a local complete 
measurement. Our work in addition suggest a question of finding a general local map that 
preserves the entanglement of a given entangled state. 
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APPENDIX A: ANOTHER PROOF FOR THE PURE STATE ENTANGLEMENT 



In the following we present a third proof for the value of the Relative Entropy of Entan- 
glement for pure states. As in the second proof we use the representation of the logarithm 
of a density operator in terms of a complex integral as in Eq. (^)-(^). We would like to 
know the value of the Relative Entropy of Entanglement for a pure state a = with 
1^) = a|00) + We assume that p = a'^\00){00\ + p^\n){ll\ is the closest disentangled 

state to a. Therefore we would have that 

Eia) = Sia\\p) . (Al) 

Assume that we change p a little bit, i.e. we have 

Px = il-X)p + Xp* (A2) 

with a small A such that p\ and p* are disentangled. For p to be the closest disentangled 
state to a we have to have that 

^^(a||(l-A)p + Ap*)U=o>0 . (A3) 

Using the complex representation of Eq. ( |5TD for the derivative of the logarithm we quickly 
find 

^^(a||(l - A)p + Ap*)U=o = -^tr{a\n[{l - A)p + Ap*]}U=o 

d 1 . , , , 



(f dztr\a— jln; 

— (f dztr Up* - p)(zl- p)-^ a (zl- p)-H In z 
7ii J 



2711 

= 1 -tr{p*(|00)(00| + |11)(11| +x|00)(ll| +x|ll)(00|) (A4) 

where x = aj3 (In — In (3"^) / {a^ — jS"^) and we have used the explicit form of a and p together 
with Cauchy's theorem Now we have to show that Eq. (|A4|) is always positive. One 



easily checks that 
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X = ap{\n - In p^)/{a^ - P^) < 1 (A5) 

where the maximum is achieved for = 1/2. The right hand side of Eq. ( [A4D can become 
smallest for a; = 1. For Eq. (^) to be positive we therefore need to show that 

tr{p*(|00)(00| + |11)(11| + |00)(11| + |11)(00|) < 1 (A6) 

Using 10+) = (|00) + \n))/V2 this follows easily as p* is not entangled and therefore 
(0+|p*|0+) < 1/2 which immediately confirms Eq. |A6| . Therefore p indeed represents 
the closest disentangled state to a and our proof is complete. 

This proof can easily be extended to arbitrary dimensional subsystems where the maxi- 
mally entangled states have the form I]„ In that case the proof becomes more similar 
to the one presented in section II. 
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FIGURE CAPTIONS 



Figl. The set of all density matrices, T is represented by the outer circle. Its subset, 
a set of disentangled states T) is represented by the inner circle. A state a belongs to the 
entangled states, and p* is the disentangled state that minimizes the distance D{(t\\p), thus 
representing the amount of quantum correlations in cr. State p\ ® p*^ is obtained by tracing 
p* over A and B. D{p*\\p\ ® p*^) represent the classical part of the correlations in the state 
a. 

Fig2. Comparison of the entanglement of creation and the Relative Entropy of En- 
tanglement for the Werner states (these are are Bell diagonal states of the form W = 
diag(F, (1 — F)/3, (1 — -F)/3, (1 — F)/3.) One clearly sees that the entanglement of creation 
is strictly larger than the Relative Entropy of Entanglement for < F < 1 
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FIGURES 




FIG. 1. The set of all density matrices, T is represented by the outer circle. Its subset, a set of 
disentangled states P is represented by the inner circle. A state a belongs to the entangled states, 
and p* is the disentangled state that minimizes the distance D{a\\p)^ thus representing the amount 
of quantum correlations in a. State p\®Pb is obtained by tracing p* over A and B. D{^p*\\p\® p*^) 
represent the classical part of the correlations in the state a. 
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FIG. 2. Comparison of the entanglement of creation and the Relative Entropy of 
Entanglement for the Werner states (these are are Bell diagonal states of the form 
W = diag(F, (1 — F)/3, (1 — F)/3, (1 — F)/3.) One clearly sees that the entanglement of creation 
is strictly larger than the Relative Entropy of Entanglement for < F < 1 
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